Synchronous TFG for speech translation

نویسنده

  • Marc Cavazza
چکیده

The use of synchronous TAG for Machine Translation has been described by Abeille et al. [1990] and has resulted in several implementations [Prigent, 1994] [Egedi et al., 1994], mainly developed using the XTAG system [Paroubek et al., 1992]. While we subscribe to the general arguments in favour of the use of TAG+ for Machine Translation, it appears that speech translation could constitute an ideal application of these ideas [Harbusch & Poller, 1994]. lt is actually easier to select specific areas where speech translation is both feasible and of practical impact (see e.g. the CST AR, VERBMOBIL and SRI Speech Translation projects). In this paper, we report the implementation of a minimal speech translation prototype based on synchronous TAG+ (more exactly, synchronous Tree Furcating Granunars or STFG), which has been developed as a direct extension of our TFG parser [Cavazza, 1998]. Sheiber & Schabes [1990] originally coined the tenn "synchronous TAG". They described synchronous derivation of semantic structures from tree operations carried on lexicalised trees. A synchronous TAG is thus a pair of two elementary trees, one representing the source language and the other a logical formula, which is also represented as a variant of TAG (and is lexicalised as weil). However, the term of synchronous TAG, when used for machine translation, actually subsumes different approaches and deserves some clarification. The initial presentation of TAG for Machine Translation by Abeille et al. (1990] referred to synchronous TAG, though in fact it directly mapped lexicalised trees to one another, without making recourse to the "semantic" trees described by Shieber & Schabes [1990]. In that sense, it could be considered as a transfer formalism or a structural correspondence system [Kaplan et al„ 1989]. Further implementations by Prigent [1994] within the XTAG system [Paroubek et al„ 1992] have been based on an extended transfer paradigm, mapping between derivation trees in the source and target languages, thus introducing an intermediate representation. On the other hand, direct mapping between lexicalised trees has also been adopted in the STAG project [Egedi et al„ 1994] [Egedi & Palmer, 1994]. We would like to suggest, adopting a terminology from Prigent [1994], that approaches based on the direct mapping between lexicalised trees should be renamed iso-synchronous. This would clearly indicate that the synchronous trees on which adjunction (resp. substitution) operations are carried are of the same kind. Our own implementation follows the isosynchronous approach, and is based on paired elementary trees in the TFG formalism [Cavazza, 1998].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ced-4 and Proto-Oncogene tfg-1 Antagonistically Regulate Cell Size and Apoptosis in C. elegans

BACKGROUND Cell-size-control systems, coupled with apoptotic- and cell-proliferation-regulatory mechanisms, determine the overall dimensions of organs and organisms, and their dysregulation can lead to tumor formation. The interrelationship between cell-growth-regulatory mechanisms and apoptosis during normal development and cancer is not understood. The TRK-fused gene (TFG) promotes tumorigene...

متن کامل

The Effect of Private Speech and Self-Regulation on Translation Quality among Iranian Translation Students: A Mixed-Methods Study

The current study presents findings from a mixed-methods study of investigating the self-regulatory role of private speech (self-talk) on students’ translation quality. The aim of the study was to validate the adapted version of a self-verbalization questionnaire. The construct validity and reliability of the scale were supported by the CFA which revealed that all items reached the acceptable f...

متن کامل

Using syntax in large-scale audio document translation

Recently, the use of syntax has very effectively improved machine translation (MT) quality in many text translation tasks. However, using syntax in speech translation poses additional challenges because of disfluencies and other spoken language phenomena, and of errors introduced by automatic speech recognition (ASR). In this paper, we investigate the effect of using syntax in a large-scale aud...

متن کامل

An Open Web Platform for Rule-Based Speech-to-Sign Translation

We present an open web platform for developing, compiling, and running rulebased speech to sign language translation applications. Speech recognition is performed using the Nuance Recognizer 10.2 toolkit, and signed output, including both manual and non-manual components, is rendered using the JASigning avatar system. The platform is designed to make the component technologies readily accessibl...

متن کامل

MT based on Probabilistic Synchronous Dependency Insertion Grammar (PSDIG)

This proposal is an extension to the project “Generation in the Context of Machine Translation”, which is one of the projects done in the Johns Hopkins University Center for Speech and Language Processing 2002 summer workshop (WS02GMT). We first address some issues observed in WS02GMT which possibly leads to detrimental effects in machine translation performance. And hence we propose a new appr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998